Search CORE

Utilisation de grilles de calcul pour la génomique comparative

Author: Blanchet Christophe
Calvat Pascal
Cardenas Yonny
Gauthey Clement
Nief Jean Yves
Penel Simon
Spataro Bruno
Publication venue: HAL CCSD
Publication date: 19/09/2011
Field of study

International audienceLarge scale phylogenomics and comparative genomics require complex computational methods (parsimony, maximum likelihood, bayesian methods, MCMC, etc.) associated with massively distributed calculation. In this respect, grid computing plays a crucial role. Here we present how we processed exhaustive similarity searches on several millions of sequences with BLAST, using two different grids (TIDRA and GRISBI)

HAL-IN2P3

Databases of homologous gene families for comparative genomics

Author: Arigon Anne-Muriel
Daubin Vincent
Dufayard Jean-François
Duret Laurent
Gouy Manolo
Penel Simon
Perrière Guy
Sertier Anne-Sophie
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

International audienceBackground: Comparative genomics is a central step in many sequence analysis studies, from gene annotation and the identification of new functional regions in genomes, to the study of evolutionary processes at the molecular level (speciation, single gene or whole genome duplications, etc.) and phylogenetics. In that context, databases providing users high quality homologous families and sequence alignments as well as phylogenetic trees based on state of the art algorithms are becoming indispensable. Methods: We developed an automated procedure allowing massive all-against-all similarity searches, gene clustering, multiple alignments computation, and phylogenetic trees construction and reconciliation. The application of this procedure to a very large set of sequences is possible through parallel computing on a large computer cluster. Results: Three databases were developed using this procedure: HOVERGEN, HOGENOM and HOMOLENS. These databases share the same architecture but differ in their content. HOVERGEN contains sequences from vertebrates, HOGENOM is mainly devoted to completely sequenced microbial organisms, and HOMOLENS is devoted to metazoan genomes from Ensembl. Access to the databases is provided through Web query forms, a general retrieval system and a client-server graphical interface. The later can be used to perform tree-pattern based searches allowing, among other uses, to retrieve sets of orthologous genes. The three databases, as well as the software required to build and query them, can be used or downloaded from the PBIL (Pôle Bioinformatique Lyonnais) site at http://pbil.univ-lyon1.fr/

Springer - Publisher Connector

Hal-Diderot

Repository/R-Forge/DateTimeStamp 2012-12-11 16:03:18

Author: Anamaria Necsulea
Delphine Charif
Depends R
Guy Perriere
Jean R. Lobry
Lazydata Yes
Leonor Palmeira
Needscompilation Yes
Simon Penel
Publication venue
Publication date
Field of study

Suggests ade4, segmented Description Exploratory data analysis and data visualization for biological sequence (DNA and protein) data. Include also utilities for sequence data management under the ACNUC system. License GPL (> = 2

CiteSeerX

Utilisation de grilles de calcul pour la génomique comparative

Author: Blanchet Christophe
Calvat Pascal
Cardenas Yonny
Gauthey Clement
Nief Jean Yves
Penel Simon
Spataro Bruno
Publication venue: HAL CCSD
Publication date: 19/09/2011
Field of study

PhEVER: a database for the global exploration of virus–host evolutionary relationships

Author: Altschul
Arigon
Baltimore
Castresana
Chantal Rabourdin-Combe
Charif
Christian Gautier
Clamp
Edgar
Ehlers
Fauquet
Filée
Finn
Gouy
Gouy
Graham
Guindon
Hambly
Holmes
Hubbard
Hughes
Jones
Klimke
Leonor Palmeira
Marchler-Bauer
Moore
Patthy
Penel
Pruitt
Sayers
Shackelton
Sharp
Shimodaira
Simon Penel
Sterk
Tian
Trifonov
UniProt Consortium
Vincent Lotteau
Wootton
Zmasek
Publication venue: Oxford University Press
Publication date: 01/01/2010
Field of study

Fast viral adaptation and the implication of this rapid evolution in the emergence of several new infectious diseases have turned this issue into a major challenge for various research domains. Indeed, viruses are involved in the development of a wide range of pathologies and understanding how viruses and host cells interact in the context of adaptation remains an open question. In order to provide insights into the complex interactions between viruses and their host organisms and namely in the acquisition of novel functions through exchanges of genetic material, we developed the PhEVER database. This database aims at providing accurate evolutionary and phylogenetic information to analyse the nature of virus–virus and virus–host lateral gene transfers. PhEVER (http://pbil.univ-lyon1.fr/databases/phever) is a unique database of homologous families both (i) between sequences from different viruses and (ii) between viral sequences and sequences from cellular organisms. PhEVER integrates extensive data from up-to-date completely sequenced genomes (2426 non-redundant viral genomes, 1007 non-redundant prokaryotic genomes, 43 eukaryotic genomes ranging from plants to vertebrates) and offers a clustering of proteins into homologous families containing at least one viral sequences, as well as alignments and phylogenies for each of these families. Public access to PhEVER is available through its webpage and through all dedicated ACNUC retrieval systems

CiteSeerX

Open Repository and Bibliography - Liège

Hal-Diderot

Integr8 and Genome Reviews: integrated views of complete genomes and proteomes

Author: Apweiler Rolf
Bower Lawrence
Das Ujjwal
Duggan Karyn
Duret Laurent
Faruque Nadeem
Gattiker Alexandre
Horne Alan
Kanapin Alexander
Kanz Carola
Kersey Paul
Kulikova Tamara
Mclaren Peter
Michoud Karine
Morris Lorna
Penel Simon
Petryszak Robert
Phan Isabelle
Reimholz Britt
Reuter Ingmar
Publication venue
Publication date: 02/08/2017
Field of study

Integr8 is a new web portal for exploring the biology of organisms with completely deciphered genomes. For over 190 species, Integr8 provides access to general information, recent publications, and a detailed statistical overview of the genome and proteome of the organism. The preparation of this analysis is supported through Genome Reviews, a new database of bacterial and archaeal DNA sequences in which annotation has been upgraded (compared to the original submission) through the integration of data from many sources, including the EMBL Nucleotide Sequence Database, the UniProt Knowledgebase, InterPro, CluSTr, GOA and HOGENOM. Integr8 also allows the users to customize their own interactive analysis, and to download both customized and prepared datasets for their own use. Integr8 is available at http://www.ebi.ac.uk/integr

RERO DOC Digital Library

Ultra-fast sequence clustering from similarity networks with SiLiX

Author: A Krishnamurthy
AJ Enright
AJ Vilella
AY Signorovitch
F Servant
H Li
HJ Atkinson
I Katriel
J Ruan
JL Boore
JM Joseph
KD Pruitt
Laurent Duret
MH Alsuwaiyel
PK Wall
PS Dehal
R Petryszak
R Tarjan
RD Finn
RE Tarjan
S Hartmann
S Hunter
S Penel
S Vishwanathan
SF Altschul
Simon Penel
SK Das
T Meinel
T Wittkop
Vincent Miele
Y Bramoulle
Y Han
Y Loewenstein
Y Tian
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background The number of gene sequences that are available for comparative genomics approaches is increasing extremely quickly. A current challenge is to be able to handle this huge amount of sequences in order to build families of homologous sequences in a reasonable time. Results We present the software package <monospace>SiLiX</monospace> that implements a novel method which reconsiders single linkage clustering with a graph theoretical approach. A parallel version of the algorithms is also presented. As a demonstration of the ability of our software, we clustered more than 3 millions sequences from about 2 billion BLAST hits in 7 minutes, with a high clustering quality, both in terms of sensitivity and specificity. Conclusions Comparing state-of-the-art software, <monospace>SiLiX</monospace> presents the best up-to-date capabilities to face the problem of clustering large collections of sequences. <monospace>SiLiX</monospace> is freely available at <url>http://lbbe.univ-lyon1.fr/SiLiX</url>.</p

Directory of Open Access Journals

Integr8 and Genome Reviews: integrated views of complete genomes and proteomes

Author: Apweiler Rolf
Bower Lawrence
Das Ujjwal
Duggan Karyn
Duret Laurent
Faruque Nadeem
Gattiker Alexandre
Horne Alan
Kanapin Alexander
Kanz Carola
Kersey Paul
Kulikova Tamara
Mclaren Peter
Michoud Karine
Morris Lorna
Penel Simon
Petryszak Robert
Phan Isabelle
Reimholz Britt
Reuter Ingmar
Publication venue: Oxford University Press
Publication date: 17/12/2004
Field of study

Performance status is the most powerful risk factor for early death among patients with advanced soft tissue sarcoma The European Organisation for Research and Treatment of Cancer – Soft Tissue and Bone Sarcoma Group (STBSG) and French Sarcoma Group (FSG) study

Author: A Duhamel
A Linden
B Bui
C Clément-Duchêne
C Evans
C Ferté
C Sessa
CA Barton
CM Parkes
EC Borden
F Chan
HT Arkenau
HT Arkenau
I Judson
I Ray-Coquard
J Fayette
J Maurel
J-Y Blay
JM Geraci
JY Blay
K Antman
L Kelly
LA Melchior
M Ando
M Maltoni
M Ouali
M V Glabbeke
M Van Glabbeke
MA Clark
N Ambalavanan
N Penel
N Penel
N Penel
NA Christakis
P Hohenberger
P Schoffski
R Simon
RS Benjamin
S Marreaud
S Mathoulin-Pelissier
S Sleijfer
S Sleijfer
V Brouste
V Karavasilis
XF Courville
Publication venue: Nature Publishing Group
Publication date: 01/01/2011
Field of study

BACKGROUND: We investigated prognostic factors (PFs) for 90-day mortality in a large cohort of advanced/metastatic soft tissue sarcoma (STS) patients treated with first-line chemotherapy. METHODS: The PFs were identified by both logistic regression analysis and probability tree analysis in patients captured in the Soft Tissue and Bone Sarcoma Group (STBSG) database (3002 patients). Scores derived from the logistic regression analysis and algorithms derived from probability tree analysis were subsequently validated in an independent study cohort from the French Sarcoma Group (FSG) database (404 patients). RESULTS: The 90-day mortality rate was 8.6 and 4.5% in both cohorts. The logistic regression analysis retained performance status (PS; odds ratio (OR) = 3.83 if PS = 1, OR = 12.00 if PS >= 2), presence of liver metastasis (OR = 2.37) and rare site metastasis (OR = 2.00) as PFs for early death. The CHAID analysis retained PS as a major discriminator followed by histological grade (only for patients with PS >= 2). In both models, PS was the most powerful PF for 90-day mortality. CONCLUSION: Performance status has to be taken into account in the design of further clinical trials and is one of the most important parameters to guide patient management. For those patients with poor PS, expected benefits from therapy should be weighed up carefully against the anticipated toxicities. British Journal of Cancer (2011) 104, 1544-1550. doi: 10.1038/bjc.2011.136 www.bjcancer.com Published online 19 April 2011 (C) 2011 Cancer Research U